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We derive a new family of convolutional character-error-correcting 
codes which are a convolutional form oj the Reed-Solomon block codes and 
as such have nonbinary symbols. We also derive a bound on the error cor- 
recting capabilities of these codes in which the error-correcting capability 
per constraint length grows approximately with the square root oj the 
constraint length. 

When these codes are used on a binary channel they are effective for both 
random and burst error correction because a single character spans several 
channel digits. 

These codes have greater error-correcting capabilities than the Robinson- 
Bernstein selj -orthogonal codes but are harder to decode. The single- 
character-error-correcting codes, when interleaved, are shown to be more 
powerful than the equivalent Hagelbarger code and appear to be simpler to 
implement. They are also slightly better than the interleaved version of 
Berlekamp's code. 

We discuss encoding and decoding algorithms and illustrate a simple 
decoding algorithm for some of the codes. These codes are closely related 
to the Bose-Chaudhuri-Hocquenghem block codes and share with them the 
decoding simplification for character erasures in place of errors. Any 
Bose-Chaudhuri-Hocquenghem decoding algorithm can be used to decode 
these codes. 

I. INTRODUCTION 

This paper is concerned with a family of character error correcting 
convolutional codes which are derived from Reed-Solomon block 
codes. 1 The derivation of the error correcting capabilities is easy be- 
cause of the algebraic nature of the code; the convolutional nature of 
the code allows the use of a simple encoder even at high rates. Also, 
sequential decoding techniques might be applicable. 

Throughout the paper we use elements of a finite field as symbols 
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instead of just 1 and 0. The elements can be represented by fc-tuples 
of ones and zeros if the field has 2 k elements; these k tuples are called 
symbols or characters. Thus with this code we are able to correct 
character errors which may be bursts of binary errors. All that one 
need know about finite fields is that each nonzero element has an in- 
verse and that there exists at least one element which when taken to 
successive powers will generate the entire field with the exception of 
the zero element. This is called a primitive element. 

The capability of the codes is given by the number of errors that 
can be corrected within a fixed number of characters (the constraint 
length). Suppose a code can correct three errors within a constraint 
length of 12. Then the code can correct any pattern of errors as long 
as no sequence of 12 characters has more than three errors in it. In 
the context of error correcting ability one can define a minimum dis- 
tance of the code. For linear codes the minimum distance (d), is 
equal to the minimum weight code word segment, one constraint 
length long, which has a nonzero first character. With this definition 
the code can correct up to (d-l)/2 errors occurring within one con- 
straint length. 

The rate of a code, R, is the fraction of characters used as informa- 
tion symbols. 

A convolutional code has its check symbols formed as a convolu- 
tion or weighted sum of a fixed number of the past information sym- 
bols. The weighted sum is formed in the algebra of the finite field. 

The codes described in Section II are capable of correcting t char- 
acter errors within a constraint length of (2t 2 — t + 1)/(1 — R) 
channel characters. The channel characters must be elements of a 
finite field of size at least [R(2t - 1) (t - 1) + 1]/(1 - R). If 
one uses as channel symbols elements of a much larger field it is pos- 
sible to construct a code which will correct t errors with a constraint 
length of only 2t/{l — R). Those codes can be encoded by a stand- 
ard convolutional encoder or by a number of accumulators which can 
also perform multiplication. Decoding can be accomplished by a 
modified Bose-Chaudhuri-Hocquenghem decoder. 

H. CONSTRUCTION OF THE CODES 

For a code of rate R = (b — l)/6 one can use every 6th symbol as a 
check symbol. To define the code completely one need only give the 
weights used in the convolution of past channel symbols. For con- 
venience we put these weights in an A'' by 6 = 1/(1 — R) matrix, B. B { ,,• = 
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Wi-i +bi (The constraint length is Nb.) If x t is a check symbol we write 
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Following Wyner and Ash we take the N by b matrix called B and form 
it into an infinite A matrix by shifting B to the right b places and 
down one place: 2 



A = 



Then by (1) any code sequence written as a vector x will satisfy 

Ax = 0. (2) 

We define the code by defining the elements of the matrix B. B is used 
instead of the weights W { because the notation is clearer. Denote the 
elements of B by B ti . Then let B ti — /3/y-" 1 where the 6 ?/s area , 
a, • • • , a h ~ 2 , 0. The symbol a is a primitive element of the field, and 
0° is taken as 1. The 7,'s are called locators. Let: /3,- = a } , 

where 



v li) = Z i = 



ci - m - 2) 



Thus for any b, N, and finite field a code is defined. 

As an example we take the special case which was presented by 
Wyner, where N = 2 and the By are given by 3 



(i-D(j-l) 



B lt = 



B = 




i 7* b 

i = b 

1 



(3) 



This code has d = 3 and thus can correct single errors, or double 
erasures in a constraint length of 26. It has the additional advantages 
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that it can be easily implemented, it is optimal,* and it has no error 
propagation. We describe, in Sections III through V, the implementa- 
tion and some properties of the single-character-correcting codes. 

III. DECODING TECHNIQUE FOR SINGLE CHARACTER CORRECTING CODES 

Single errors are particularly easy to correct because the first 
syndrome (difference between the calculated check character and the 
received check character) is equal to the error, and the second syn- 
drome is equal to the error multiplied by the error location. Since 
we take 

7. = a' _1 , * H b ^ 

7,- =0, i = b 

the error location i can be found by dividing the second syndrome S, 
by a and comparing the result to the first syndrome, So- This is 
repeated until they agree. The division can be implemented by a 
shift register whose feedback path corresponds to the coefficients of 
g (re) the generator polynominal of «, the primitive element.* 

TV. HARDWARE IMPLEMENTATION FOR SINGLE CHARACTER CORRECTING CODE 

The implementation is described through the example: symbols in 
GF{2 k ) k = 2, b = 4. The elements of GF{±) are represented as 
binary 2-tuples. Since x 2 + x + 1 is a primitive polynomial in GF{2), 
division by a can be instrumented by a two-state shift register with 
appropriate taps. The decoder takes form of Fig. 1. The entire re- 
ceived vector is shifted into the data buffer and then the syndromes 
So and Si are computed and stored in two registers R and B 1} respec- 
tively. So identifies the error pattern. If So is nonzero, it is assumed 
that a single character error of the pattern shown occurred. For each 
character shifted out of data buffer the R i register is shifted once 
with the feedback path connected (which corresponds to a division 
by a). The error pattern So in R matches that contained in register 
Ri when the erroneous character just emerges out of the data buffer. 
This character need only have S subtracted from it to complete cor- 
rection. It is possible to do all of this with simple logic circuitry and 
a storage of 26 k bits. 



*No other code, of the same constraint length, which corrects single errors, 
can have a higher rate (that is, fewer check symbols). 
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Fig. 1 — Decoder for single error correcting code. Go and Gi are timing signals: 
Go is high during the time information characters appear; Gi is high when parity 
character appears. 

V. PROPERTIES OF THE SINGLE CHARACTER CORRECTING CODE 

Observe that for eveiy b character one must decide, based on the 
current syndromes of 2k bits, the error pattern and the location of 
the error. Since there are b possible error locations one needs at least 
[\ogob] = k bits for identification if 2*- 1 < b, and as there are 2 k error 
patterns in a character (including the no-error pattern) that calls for 
k bits of information; thus the lower bound on the number of syn- 
drome bits is 2/c which shows the code is optimal* when b > 2* -1 . 

Notice that since the syndrome is reset to zero after correction (cor- 
responding to the removal of the effect of the error on syndrome) no 
error propagation is possible. 

Interleaving can be applied to this class of codes to achieve a class of 
near-optimal burst-error correcting codes. If the interleaving degree is 



* It is optimal in the number of check symbols. 
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in then two adjacent characters in the original code are separated by 
m — 1 characters or L = k{m — 1) binary digits. A burst of length L 
+ 1 bits will never affect two adjacent characters of the original codes; 
hence the interleaved code is capable of correcting an L + 1 bit burst. 
Since the original code has a storage requirement of 2bk bits, the in- 
terleaved code requires, at most, 2kbm bits. Observe that to correct a 
particular character one need not store the last character of the other 
ra — 1 interleaved codes; therefore the storage requirement is 2 bkm 
- (m - l)k = 2 bk(l + L/k) - L. 

The required guard space is simply one less than the constraint 
length.* This can be seen by observing that in correcting a burst the 
syndrome must be set to zero when the last character in the burst is 
corrected. It follows that the guard space must always be shorter 
than the constraint length. Table I compares some members of this 
class of codes with some Hagelbarger burst-error-correcting recur- 
rent codes as well as the Berlekamp burst-error-correcting recurrent 
codes (assuming Massey's decoding algorithm). 5-7 Table I shows this 
class of codes requires less guard space and hence is more powerful. 

Table I — Comparison of Burst Correcting Codes 
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* More precisely, the number of shift registers. 
Note: A - Hagelbarger codes. 

B - Berlekamp's type B2 burst-error correcting codes modified by inter- 
leaving. 

C - Single-character correcting codes modified by interleaving. 

Although the number of shift register stages is not an accurate meas- 
ure of decoder cost, it is seen that, except for the rate of %, Hagel- 
barger's codes generally cost more, especially at higer rates. Although 



* In our case the constraint length is equal to the storage requirement. 
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the interleaved single character correcting codes are slightly better 
than Berlekarap's code for correcting type Bl bursts, they are both the 
same (and optimal) for type B2 burst correction. 2 

VI. GENERALIZATION 

In order to demonstrate the minimum distance of codes with a 
longer constraint length we rely on the following lemma. 

Lemma: If the code with N = A r i ^ 1 has a minimum distance of at 
least d, , then the code with N a = N t -\- d, — 1 has a minimum distance of 
at least a\ + 1- 

The proof of this lemma depends on showing that no code word with 
di or less nonzero elements can satisfy Ax = 0, which is equivalent to 
showing that any d, columns of A form a matrix of rank at least di 
and thus equation (2) can be satisfied only by x = among all x with 
weight d, or less. 

Proof: By the assumption that the code with N = Ni has minimum 
distance d x , there must be at least di nonzero elements in the first NJ) 
symbols of x. Assume that there are just d x and no more, as well as no 
additional nonzero symbols in the rest of the constraint length N 2 . 
We write x' as a dj dimensional vector consisting of only the nonzero 
elements of x. Accordingly A'x' = 0, where A' is a N 2 by d x matrix with 
columns corresponding to the elements of x'. We must choose d x rows of 
A' which have a nonzero determinant. Assume for the moment that 
none of error locations are zero. We then choose the bottom d y of A'. 
We have a d r by d x array of nonzero elements. Each of the matrix 
element is guaranteed to be nonzero by the fact that all the nonzero 
elements of x lie within the first N x b elements and consequently A' can 
only have no zeros in the d,th or lower rows. 

We now take the d y by di array and divide each column by the first 
element. The first row is now all l's and the fcth row is 

M + * - i) _ /Wi7i**~' 

= ( a ' (6 - i > 7 .)*-y< i - i ><*- 4 >< 6 - i >/ 2 ' < (5) 

We next divide each row by a [{k ~' l)(k ~ i) ' 6 - 1 )/ 2 i thus obtaining a Van- 
der-Monde matrix. The determinant of the matrix cannot be zero since 
the quantity 



a 



.1(1.-1) 



7. (6) 
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is unique for each column. If q of the locators are zero, we choose the 
bottom d x — q rows and those q rows which correspond to the l's in the 
q zero locator columns. This determinant is nonzero by the same logic. 
Consequently, the equation A'x' = 0, can only be satisfied by x' = 0, 
which contradicts our assumption. Therefore, the assumption is untrue 
and the minimum weight code word which satisfies (2) must have 
weight di + 1 or larger, therefore providing the lemma. 

We now prove the general result by induction. When N = 1 the 
matrix 1 obviously has rank 1 and thus the minimum distance is 2. 
To obtain a minimum distance of 3 we need to add 2 — 1 = 1 to 6, 
thus N = 2. Each time we increase the minimum distance by 1 we 
increase AT by d — 1, thus 

i-i * 

2N - 4 = d(d - 3) for any value of d ^ 2. Since this is a lower bound 
on d we can be assured that for any d ^ 2 we can build a code with 
2N - 4 ^ d(d - 3). 

The field must be of sufficiently large size so that all the "locators" 
of (6) be different. This can be done if the field has at least {b - 1) 
[{d — 2) (d — 3) + 2]/2 nonzero elements. Thus for example we 
could build a code which corrects two errors (d = 5), has a rate of 
% (6 = 4) with a 16 element alphabet. The constraint length would 
be Nb = 7 X 4 = 28. This could be implemented by using 4-tuples as 
symbols with an overall constraint of 112 binary digits. 

VII. ALTERNATIVE CODE 

It is also possible to obtain a minimum distance of N + 1 if one uses 
only a certain set of symbols as locators. In the previous section one 
needed a field with at least (b - 1) [ (d - 2) (d - 3) + 2]/2 nonzero 
symbols. In this section we show that N can be made equal to (d — 1) 
if one is willing to use symbols from a much larger field. For this pur- 
pose we define B by: 
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. w -g.-- tf - 2 w - 1) 

where yt is the locator and (3 will be defined later. As in Section VI 
we wish to show that 

Ax = (7) 

only if x has d or more nonzero elements, given that there is at least 
one nonzero element among the first b. Suppose there exists a code word 
x with weight d — 1 or less such that (7) is met. Then take the N by 
d — 1 matrix consisting of the d — 1 columns of A corresponding to the 
nonzero components of x, and call it A'. The determinant of A' is a 
polynomial in /3, where the lowest power of /3 is no lower than that found 
along the main diagonal. This comes about because the right-hand 
columns are shifted down and thus contribute least to the power of /3 
in the bottom rows. The highest power of /3 is shown in the appendix to 
differ from the lowest power by no more than: 



[ff^M^] 



- 1 



If we call the difference between the highest and lowest power of 0, r, 
then the determinant can only be zero when /3 is a root of an rth order or 
smaller polynomial with coefficients from the field containing the 7/s. 
Consequently we can choose /3 from an r -f- 1 order extension field such 
that it is a root of an irreducible r + 1 order polynomial. Then it cannot 
be the root of a polynomial of degree r or less, and the determinant 
cannot be zero. Consequently x = for (7) to be met. 

We have now defined a as a member of the field with q r+1 elements 
where GF(q) is the locator field. In other words our symbols are 
taken from a <j r+1 element field and the locators a f are confined to the 
q element subfield. For example, suppose we require a double-error- 
correcting code (N = 4) with a rate %. We use the four element field 
as locators, and since r + 1 = 4 the symbols must come from a 4 4 = 
256 element field. This results in an overall constraint length of 
8 X 4 X 4 = 128 bits. 

VIII. IMPLEMENTATION 

The encoding can be accomplished by the standard convolutional 
encoder. The decoding presents the same problems as the decoding of 



* [•] is the symbol for the integer part of 
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a nonbiliary Bose-Chaudhuri-Hocquenghem code. One takes increas- 
ing estimates of the number of errors within a constraint length and 
solves for the error locations and values. If the estimate is incorrect 
there is no consistent solution for the error values. The problem of 
error propagation can be easily handled by introducing a second set 
of syndrome calculators which are not changed when errors are cor- 
rected. A guard space equal to the constraint length, with no errors, 
will then produce zeros in all these syndromes, indicating that there 
have been no errors in the last constraint length. 

In both coding and decoding, symbols must be added and multi- 
plied. This is easy to implement when the symbols are represented as 
binary k tuples. Addition is just bit by bit modulo two addition; 
multiplication by a (the primitive element) is accomplished by a 
linear shift register. 4 

IX. CONCLUSIONS 

We have presented a class of algebraic convolutional codes. These 
codes can be compared to other convolutional codes, such as Robinson 
and Bernstein's self-orthogonal codes. 8 The Robinson-Bernstein codes 
are binary error correcting codes which have a constraint length bounded 
by N ^ [(2t 2 — t)(b — 1) + l]b. Our codes have a constraint length 
bounded by N ^ (2t 2 — t + 1)6, which is less than the above for 
6^2. Ignoring the fact that our code is a character error correcting 
code, our code is more powerful than the Robinson-Bernstein code. 

If one takes account of the fact that characters are several bits long, 
one can still make a comparison by taking the constraint length of our 
code as 

N £ (2t 2 - t + 1)6({ n 2 [(b - l)(2t 2 - St + 1) + &]} + 1), 

[•] integer of. 

One now finds that our code is more powerful than the Robinson- 
Bernstein code for sufficiently large b. More specifically, if t 2 ^ 2 /166 
our code is more powerful. The Robinson-Bernstein codes have the 
advantage that they can be decoded by majority logic devices while we 
require much more complex devices. 

The codes described in this section may be useful when one desires 
a high rate code. Here the amount of storage needed at the encoder 
can be made equal to the redundancy per constraint length. The 
decoder must store the entire constraint length in order to make cor- 
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rections but the part of storage needed to compute the error locations 
and values is only equal to the redundancy. The disadvantage is that 
the decoder must perform finite field multiplication, division, and 
addition. 

The code is also capable of correcting erasures; decoding is much 
simpler in this case. Therefore it may be advantageous to let the 
k tuples (which represent symbols) be code words in a binary sub- 
code. The subcode then could be used to detect binary errors (produc- 
ing an erasure), and the convolutional code used to correct these 
erasures. This is a convolutional version of Forney's concatenated 
block codes. 

The decoding algorithm is also simple for any code of distance four 
or less. This includes the single-error correcting code, the double- 
erasure correcting code, and the single-error-plus-single-erasure code. 
The decoding complexity is comparable to that of Hamming codes. 

APPENDIX 

Bound on the Powers of /? 

We wish to bound the difference between the maximum and minimum 
power of /3 in the determinant of A'. The matrix will be made up of 
columns taken from B or columns from B shifted downward. This 
shifting downward of columns is the only parameter which effects our 
bound. Therefore we take Z, as the amount that the ith column is 
shifted; the ith. column is headed bj r Z, zeros. By ordering we make Z, a 
nondecreasing function of i. We can assume that Z< < i; otherwise this 
would put a zero on the main diagonal and produce a zero determinant. 
One could then take the largest nonsingular upper left minor A" and 
write A"X" = 0. The only difference here is that A" has smaller size; 
but the bound will still be valid. 

The determinant of the matrix is given by 

e n &«.,«, (8) 

where a is a permutation of the numbers 1 to N. The term b iiC{i) is 
zero if <x(i) ^ Z, ; otherwise it contains /3 to the \[<r{i) — Z< — 2] 
• ["(*) — %i ~ 1] } /2 power. For any given u the product of (8) contains 
/3 to the 

\ £ [a(i) - Z,. - 2][*(i) - Z< - 1] (9) 
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power, provided that for all i <r(i) > Z ( . Equation (9) can be re- 
written 

J S A*) - Mi) - 2®Z< + ( z < + 2 X Z < + !)• ( 10 ) 

i-i 

The only term which depends on <r is 

- £ cr(i)Z< . (11) 

i = i 

The minimum power of p in (8) is given by (10) when we minimize 
(11). Since Z { is nondecreasing the term which minimizes (11) is 
clearly *(i) = i. Furthermore, the only other terms of (8) which con- 
tribute to this lowest power of (3 come from o-'s which permute the i's 
over regions where Z { is constant. 

The difference between the maximum power of /3 and the minimum 
power is 

D - 2 iZ< - Min E ff(i)Z t 

i = l r.9(i)>Zi t-1 

= Max E [i - cfflZt = E (t - * m iu(i)]Zi . (12) 

a,ali)>Z{ i-1 •'-! 

The minimization in (12) is accomplished by assigning the smallest i 
to the largest Z t , subject to the constraint that <r{i) > Z t . This can be 
done by starting with Z N and assigning the smallest possible integer 
to (t{N), namely Z N + 1. The term Z*-i then receives the next small- 
est integer which is still free, and so on. This is the minimum as any 
other acceptable <t can be converted into this <r min by a sequence of 
pairwise permutations which do not increase the sum. In Fig. 2 we 
show an example of the assignment on a square array where the 0's 
indicate the a which minimizes the sum and the x's that which maxi- 
mize it [<r(i) = i]. The 0's in Fig. 2 are not exactly as described above 
but are permuted somewhat above equal value Z { 's for reasons which 
will be clarified in this appendix. 

We now show that we can increase various Z t 's to Zj's so that A< = 
Z' i+1 — Z\ ^ 1 (in Fig. 2 we increase Z 4 , Z s and Z 8 ) and D will not de- 
crease. When we change the Z, , a min (i) does not change. When A,- ^ 1, 
our construction of <r mia (i) gives <x min (i) = Z i+l < i + 1; thus <r min (i) ^ i, 
and i - <r min («) ^ 0. Once we have the new Z\ with Z' i+i . - Z{ ^ 1 we 
write D' ^ Din terms of a sequence 

M' = Z' i+ i ~Z\= (■• 
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Fig. 2 — Bounds on the minimum and maximum powers of p in the determinant 
of A. 
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then 



z'< = Z a; , 

D' - £ [tf - *<t)]Z5 = £ [i - «] £ a; 

i - 1 ■ = 1 

- £ A; £ [< - er(t)] 



j-1 



, - 1 i - 1 + 1 



= z 4 

i-i L 



(AT - j)(N + j + 1) 



£ »G>1- 

i-l + 1 J 



(13) 



We observe that our choice of <r(i) gives 

£ *(*) " 2^ * = [AT - j]\ t> 

,-,+1 .-Z/+2 L ■* 



when A' = 1, 



and (13) becomes 



£ Aj(iV -i)(; - i-z,). 



For each value of Z,(0, 1, 2, • • •) there can be only one nonzero A' t ; if 
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one takes K to be the number of nonzero A< , then 

D> =Z(N- /*)(/* - 1 - k), 

where /* is that value of ; for which Z, = k and AJ = 1. We now show 
that 

Z (N - j k )(h - 1 - 7c) ^ E (N - t)(/o - D- 

The terms on the right are all nonnegative and for every k on the left 
there is an i on the right with i = f k . For that term /* ^ / + &> as /* 
cannot increase faster than k. 

U ~ 1 ~ fc ^ /o ~ 1, 

Ctf - «(/. - 1 - fc; ^ (iV - /,)(/o - 1). 

D'*£(N- i)(U - 1) = (/„ - 1) (^-/nA^-/o + D . (14) 
The integer value of / which maximizes this is 

U = I jj-J + 1, [•] = integer part of- 
and 



- * \flH\ 



-1- (15) 



3 
This bound on D can actually be achieved for any N. 
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